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1 

METHOD AND APPARATUS FOR TRANSFERRING NO N-SPEECH DATA 

IN VOICE CHANNEL 

Field of the Invention 

The present invention relates generally to a mobile communication 
method and apparatus, and more particularly, to a communication method 
and apparatus for transferring non-speech data over voice channel in cellular 
mobile telecommunication system s. 

Background Art of the Invention 

In current 2G/3G mobile communication systems, speech signals and 
non-speech data are transferred respectively , with speech signals via voice 
channel and non-speech data via dedicated data channel . 

The processing flow chart of transferring speech signals between two 
existing GSM mobile terminals Is shown in Fig.1. As illustrated in the figure, 
before being transmitted to the network system, the speech signal to be 
transmitted at the mobile terminal as sender side, is anaiog-to-digital 
converted by ADC 10, speech -compressed by speech compression unit 20, 
channel-coded by channel coding unit 30 and modulated by modulation & Tx 
unit 40 in Tx RSS (Transmitter Radio Subsystem ) 93. While at the mobile 
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terminal as receiver side, the received speech signal from the network 
system is demodulated by Rx & demodulation unit 50 and channel -decoded 
by channel decoding unit 60 in Rx RSS (Receiver Radio Subsystem) 96, 
then speech -decompressed by speech decompression unit 70, and 
digital-to-analog converted by DAC 80. Thus, at last, the original speech 
signal transmitted by the mobile terminal as sender side is recovered after 
the aforementioned processing ste ps. 

Generally speaking, in a common dialog procedure, the behavior of 
either side of the communication includes two states: speaking and 
non-speaking. Based on this fact, a mobile terminal will transmitted the 
signal discontinuously in GSM mobile communica tion system. In another 
word, besides transmitting speech signals containing speech information, the 
mobile terminal also selectively transmits signals only containing background 
noise without speech information, to reduce signal interference and save 
energy. (That means the mobile terminal selectively transmits signals only 
containing background noise without speech information, besides transmits 
speech signals containing speech information normally, so as to reduce 
signal interference and save energy.). To implement discontinuous 
transmission mechanism in a mobile terminal. GSM mobile communication 
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system uses a discontinuous transmission handler at the transmitter side 
(abbr. as Tx DTX handler later) In speech compression unit 20 to achieve 
discontinuous speech transmission, and a discontinuous transmission 
handler at the receiver side (abbr. as Rx DTX Handler) in speech 
decompression unit 70 to achieve discontinuous speech receiving. 

Fig.2 is a block diagram illustrating current speech processing unit used 
in GSM full-rate speech traffic. The speech processing unit comprises thJe 
functional block of speech compression unit 20 used for transmitting data, as 
well as the functional block of speech decompression unit 70 used for 
receiving data . Additionally, ADC 10. Tx RSS 93, Rx RSS 96 and DAC 80 are 
all included in Fig.2 as well, to describe the complete procedure for 
transmitting/receiving speech signal. 

As illustrated in Flg.2, Tx DTX handler 90 for transmitting data comprises: 
speech encoder 901 (defined in GSM 06.10 standard). Tx DTX control & 
operation unit 902 (defined in GSM 06.31 standard), VAD (voice activity 
detector) 903 { defined in GSM 06.32 standard) and Tx comfort noise unit 
904 (defined in GSM 06.12 standard). Meanwhile, Rx DTX handler unit 100 
for receiving data comprises: Rx DTX control & operation unit 1001 (defined 
in GSM 06.31 standard), speech decoder 1002 (defined in GSM 06.10 
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Standard), speech frame substitution unit 1003 (defined in GSM 06.11 
standard) and Rx comfort noise unit 1004 (defined in GSM 06.12 standard). 

A detailed description will be given below to the method for transmitting 
and receiving speech signal based on discontinuous transmission 
mechanism in GSM full -rate speech traffic, in conjunction with Fig.2. 

When a mobile terminal transmits speech signal, ADC 10 converts the 
analog speech signal to be transmitted into a 13 -bit uniform PCM (Pulse 
Code Modulated) digital speech signal with 6000 samples per second, and 
delivers the digital speech signal to Tx DTX handler unit 90. 

Speech encoder 901 in Tx DTX handler 90 receives the digital speech 
signal from ADC 10, and compresses the received signal Into speech frames 
suitable for transmission over radio links (50 frames/second and 260 
bit/frames for example), then sends the speech frames to Tx DTX control & 
operation unit 902. The speech frame including speech information from 
speech encoder 901 Is 20ms in duration. If the speech frame only has 
background noise included, without speech information, the frame is called 
as silence speech frame. 

VAD 903 detects the digital speech signal from ADC 10 to determine 
whether the speech frame contains speech information. If speech 
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information is contained, the VAD flag is set to 1; if the frame only contains 
background noise without speech i nformation, the VAD flag is set to 0. 

Tx comfort noise unit 904 detects the speech frame from speech 
encoder 901. If N consecutive silence speech frames are detected, the 
background noises embedded in the N consecutive silence speech frames 
will be summed and averaged, to get the average background noise. Then, 
the background noise parameters can be computed from the average 
background noise and encoded into a special frame, i.e. the silence 
description frame (SID frame, with 260 bits/frame). Finally, the S ID frame 
containing background noise parameters is transmitted to Tx DTX control & 
operation 902. 

Wherein: 

First, in Tx comfort noise unit 904, the number N of consecutive silence 
speech frames used for computing the average background noise, is called 
the number of silence speech frames for computing the SID frame. The value 
of N is 4 in GSM full -rate speech traffic while is 8 in GSM half-rate speech 
fraffic. 

Second, the silence speech frame generated in Tx comfort noise unit 
904 is 20ms in duration, and o nly contains the parameter for generating 
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background noise in tfie mobile terminal as receiver, but without speech 
information. 

Since the speech frame from speech encoder 901 has the same 
duration as the SID frame from Tx comfort noise unit 904, some specif ic bits 
in the frame can be defined as SID code word, to distinguish different types 
of frames. For the SID frame, its SID code word is all 0. while the SID code 
word is not all 0 for the speech frame. Therefore, a frame can be determined 
as speech frame o r SID frame by detecting the SID code, that is, by checking 
the value of said specific bits. 

For example, in the GSM full -rate speech traffic as illustrated in Fig.3. 
every 20ms frame is composed of 260 bits, in which 36 bits are used for LAR 
(Log Area Ratios) and the other 224 bits for 4 sub -frames. Each sub-frame is 
composed of 56 bits, in which 7 bits are used for LTP (Long Term Prediction) 
lag parameter. 2 bits for LTP gain parameter, 2 bite for RPE (Regular Pulse 
Excitation) grid position parameter, 6 bits for Block Amplitude parameter and 
39 bite for RPE -pulse Xmc parameter. 

In Fig.3, the 156 -bit Xmc parameters (39bits/subframeM subframes=156 
bite) is made up of RPE -pulse X^c of each sub -frame, in which 95 bits of 
RPE-pulse Xmc drawn out from the total 156-bit X^c parameter sets, are used 
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as a SID code word and the remained 61 bits are reserved for future use. If 
the SID cxide word equals to zero, the frame is a SID frame, otherwise, it is a 
speech frame. 

On receipt of speech frames from speech encoder 901 or SID frames 
from Tx comfort noise unit 904, Tx DTX control & operation unit 902 checks 
the state of the VAD flag. If the VAD flag is 1 , Tx DTX control & operation unit 
902 sets the SP (speech period) flag to 1 and sends the speech frame from 
speech encoder 901 to Tx RSS 93. If VAD changes to 0 from 1, It indicates 
that the speech burst is over, then Tx DTX control & operation unit 902 
checks N elapsed, which is the number of speech frames sent to Tx RSS 93 
since the latest SID frame update., to determi ne whether the hangover 
procedure needs to be enabled. 

Herein, hangover procedure is a mandatory mechanism for updating 
SID frame. Without the hangover procedure, after sending a speech burst, a 
mobile terminal then sends a SID frame generated before said speech burst. 
There will be no problem if the speech burst is very short. But if the speech 
burst is very long, the background noise embedded in the SID frame 
generated before this long speech burst, may be significantly different from 
that generated when the speech burst ends. If the mobile terminal as 
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receiver generates background noise by using the SID frame generated 
before the long speech burst, mismatching background noise will be 
produced, and it can be very uncomfortable for the mobile temriinal as 
receiver. In order to overcome this deficiency, a mobile terminal first sends N 
silence speech frames to the mobile terminal as receiver after the long 
speech burst, and then sends a new SID frame containing the latest 
background noise parameters generated according to these silence speech 
frames. 

Descriptions will be given below to the operations of DTX handler unit 
90, with regard to enabling hangover procedure and not enabling hangover 
procedure respectively, in conjunction with Fig.4 and Fig. 5. 

1. Not enabling hangover procedure, wherein N elapsed, the number of 
speech frames sent out. is not more than the predefined number of speech 
frames required for enabling hangover procedure. 

Tx DTX control & operation unit 902 won 't enable a hangover 
procedure, and sets the SP flag to 0. and then fonA/ards the SID frame from 
Tx comfort noise unit 904 to Rx RSS 93. 

Fig.4 is a schematic diagram illustrating the discontinuous transmission 
mechanism In current GSM full -rate speech trafllcwhen hangover procedure 
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is not enabled. As the figure illustrates, VAD 903 detects there is some 
speech information in the speech signal during the speech burst, and thus 
sets the VAD flag to 1; when the speech burst is over, no speech information 
can be detected, thus VAD 903 sets the VA D flag to 0. When the VAD flag is 
1, Tx DTX control & operation unit 902 sets the SP flag to 1, and forwards the 
speech frames from speech encoder 901 to Tx RSS 93. When the VAD flag 
changes from 1 to 0, Tx DTX control & operation unit 902 checks N ©lapsed, the 
number of speech frames sent to the Tx RSS since the last SID frame 
update, and finds N eiapsed=22. This means it is not more than the predefined 
threshold of the hangover procedure, and thus the hangover procedure 
needn't be enabled. Therefore, Tx DTX control & operation unit 902 sets the 
SP flag to 0, and forwards the SID frame from Tx comfort noise unit 904 to Tx 
RSS 93. Wherein, the number of silence speech frames used for computing 
the SID frame is 4, so Tx comfort noise unit 904 generates a new SI D frame 
after detecting 4 consecutive silence speech frames, hence the first 4 SID 
flrames (for example, SID k as shown in the figure) sent to Tx RSS 93 after the 
end of the speech burst are generated before the speech burst, and the fifth 
and consequent frames thereafter (for example, SiDk+i as shown in the 
figure) are new SID frames generated after the speech burst 
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2. Enabling hangover procedure, wherein N elapsed, the number of 
speech frames sent out. is greater than the predefined number of speech 
frames required for enabling hangover procedure. 

Tx DTX control & operation unit 902 enables the hangover procedure, 
that is, Tx DTX control & operation unit 902 first continues to set the SP flag 
to 1, and forwards the consecutive silence speech frames from spee ch 
encoder 901. having the same number as the silence speech frames used 
for computing the SID frame, to Tx RSS 93. Then, the SP flag is set to 0. and 
the new SID frame from Tx comfort noise unit 904 and generated according 
to said silence speech frames, i s sent to Tx RSS 93. 

Fig.5 is a schematic diagram illustrating the discontinuous transmission 
mechanism in current GSM full -rate speech traffic when the hangover 
procedure is enabled. As illustrated in the figure, VAD 903 detects that there 
is some speech information contained in the speech signal during the 
speech burst, and sets the VAD flag to 1 ; when the speech burst is over, VAD 
903 detecte there is no speech information contained in the speech signal, 
so VAD 903 sets the VAD flag to 0. When the VAD fl ag is 1 , Tx DTX control & 
operation unit 902 sets the SP flag to 1, and fonwards the speech frames 
from speech encoder 901 to Tx RSS 93. When the VAD flag changes flrom 1 
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to 0. Tx DTX control & operation unit 902 checks N elapsed, tlie number of 
speech frames sent to Tx RSS 93 since the last SID frame update, and finds 
Neiapsed=50, greater than predefined threshold for hangover procedure, thus 
the hangover procedure is enabled. That means, the SP flag is set to 1 first, 
and 4 consecutive silence speech frames from speech encoder 901 are sent 
to Tx RSS 93. Then, the SP flag is set to 0, and the new SID frame (for 
example, SIDr+i as shown in the figure) from Tx comfort noise unit 904 and 
generated according to said 4 silence speech frames, is sent to Tx RSS 93. 

On receipt of the frames from Tx DTX control & operation unit 902, Tx 
RSS 93 checks the state of the SP flag. If the flag SP is detected to be 1 . it 
will forward the frame from Tx DTX control & operation unit 902 to a mobile 
terminal as receiver. If the SP flag is found to be 0, the SID code word of the 
frame from Tx DTX control & operation unit 902 will be checked. If the SID 
code word equals to 0, that is, the frame is a SID frame, it will send the frame 
to the network system and switches to idle state. 

Rx RSS 96 in the mobile terminal as receiver receives frames 
transmitted from the mobile terminal as sender via the network system, and 
forwards the received frame to Rx DTX control & operation unit 1001. and 
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sets the BFI (bad frame indicator) flag, SID fla g and TAF (time alignment flag) 
flag meanwhile: 

(1) If the received frame is a perfect speech frame, set BFI=0 and SID=0; 

(2) if the received frame is a bad speech firame, set BFI=1 and SID=0; 

(3) if the received frame is a perfect SID frame, set BFI=0 and SID=2, and 
mark the position of the SID frame in the multiple SACCH (slow 
associated control channel) frames in the TAF flag; 

(4) if the received firame is a bad SID firame, set BFI=1 and SID=1. 

On receipt of frames from Rx RSS 96. Rx DTX control & operation unit 
1001 performs relevant processing on the received firame according to the 
states of BFI, SID and TAF: 

(1) if BFI=0 and SID=0. it indicates that the received frame is a perfect 
speech frame, thus the perfect speech frame will be sent to speech decoder 
1002; 

(2) if BFI=1 and SID=0, it indicates that the received frame is bad speech 
frame, speech frame substitution unit 1003 will be instructed to generate a 
perfect speech firame; 

(3) if BFI=0 and SID=2, it indicates that the received firame is a perfect 
SID frame, the perfect SID frame will be sent to Rx comfort noise unit 1004; 
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(4) If BFI=1 and SID=1, it indicates that the received frame is a bad SID 
frame, speech frame substitution unit 1003 will be instructed to compute the 
bacl<ground noise parameters. 

Speech decoder 1002 receives perfect speech frames from Rx DTX 
control & operation unit 1001, then gets digital speech signals by speech 
decoding the perfect speech frames, and sends the digital speech signals to 
DAC 80. 

Rx comfort noise unit 1004 receiv es the perfect SID frames from Rx 
DTX control & operation unit 1001, then extracts background noise 
parameters from the perfect SID frames and sends the parameters to speech 
decoder 1002 to generate background noise. 

Speech frame substitution unit 1003 gen erates perfect speech frames or 
computes background noise parameters according to the instructions of Rx 
DTX control & operation unit 1001, and then sends the generated speech 
frames or background noise parameters to speech decoder 1002. 

DAC 80 receives the digital speech signal from speech decoder 1002, 
then converts the digital speech signal into analog speech signal and sends 
it to the corresponding processing unit. 

Detailed descriptions have been offered to the transmission and 



wo 2005/048620 



PCT/IB2004/052319 



14 

receiving of speech signals based on discontinuous transmission 
mechanism in GSIVI communication system, in conjunction witli Fig.2, Fig.3 
and Fig.4. 

Just as stated, wlien tlie speecli burst is over, the Tx DTX control & 
operation unit sends SID frames, or silence speech frames and SID frames 
to the Tx RSS. Since there is no speech information in SID frames and 
silence speech frames, if data frames containing non -speech data rather 
than silence speech frames or SID frames are sent when silence speech 
frames or SID frames are supposed to be sent, non-speech data can be sent 
via voice channel \Anthout affecting the delivery of speech Infomiation. Thus, 
no dedicated data channel is needed for the delivery of non -speech data, 
which will significantly save radio resources for communication and eliminate 
the time for establishing dedicated data channels. 

Summary of the Invention 

The object of present Invention is to provide a method and apparatus for 
transferring non -speech data over voice channel in mobile communication 
systems. In the proposed method and apparatus, non -speech data can be 
transferred via voice channel, instead of dedicated data channel, through 
modifying the SID frame or silence speech frame, and thus the radio 
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resource can be greatly saved. 

A method is proposed for a mobile term inal to transmit non-speech data 
via voice channel in mobile communication systems in accordance with the 
present invention, comprising: encapsulating the non -speech data to be 
transmitted into in -band data (IBD) frames; storing the IBD frames in a buffer; 
detecting whether the speech burst sent to another mobile terminal is over; 
checking whether there is any non -speech data to be sent to said another 
mobile terminal if the speech burst is over; sending at least one non -speech 
data frame to said another mo bile terminal via voice channel, if there is 
non-speech data to be sent. 

A method is proposed for a mobile terminal to transmit non-speech data 
via voice channel in accordance with the present invention , comprising: 
detecting the frame received from anothe r mobile terminal; if the received 
frame is an in-band data (IBD) frame, storing the IBD frame in a buffer; 
generating background noise by taking advantage of the previously received 
SID frames. 

IBD frames generated in accordance with the present invention , can be 
classified into three types: (1) the IBD code word for marking the IBD frame 
is composed of the SID code word for marking the SID frame, and the value 
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Of each bit selected from the bits that form the SID code word, for 
differentiating the IBD code word from the SID code word, can 't be the same 
as that of each bit for marking the SID code word; (2) the IBD code word for 
marking the IBD fi^ame is composed of all bits for carrying Block Amplitude 
parameter and at least one bit selected from the SID cod e word for marking 
the SID frame, and the value of each bit for carrying Block Amplitude 
parameter is zero, and the value of each bit selected from the SID code word 
can't be the same as that of each bit for marking the SID code word; (3) the 
IBD code word for marking the IBD frame is composed of the SID code word 
for marking the SID frame and at least one reserved bit not included in the 
SID code word. 

Brief descriptions of the Accompanying Drawings 

Fig.1 is a schematic diagram illustrating speech signal transmission 
between two cunrent GSM mobile terminals ; 

Fig.2 is a block diagram illustrating current speech processing unit used 
in GSM full -rate speech traffic; 

Fig.3 is a schematic diagram illustrating current speech frame used in 
GSM full -rate speech traffic; 

Fig.4 is a schematic diagram illustrating the current discontinuous 
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transmission mechanism in GSM full -rate speech traffic when the hangover 
procedure is not enabled; 

Rg.5 is a schematic diagram illustrating the discontinuous transmission 
mechanism in GSM full -rate speech traffic when the hangover procedure is 
enabled; 

Fig.6 is a schematic diagram illustrating an embodiment of the first type 
of IBD frame in the present invention; 

Fig.7 is a schematic diagram illustrating an embodiment of the second 
type of IBD frame in the present invention; 

Fig.8 is a schematic diagram illustrating an embodiment of the third type 
of IBD frame in the present invention; 

Fig.9 is a block diagram illustrating the speech processing unit used in 
GSM full-rate speech traffic in accordance with the present invention; 

Fig.lOA and Fig.lOB are flow charts illustrating the transmission of the 
first type of IBD frames in accordance with the present Invention when the 
hangover procedure is not enabled; 

Fig.llA and Fig.HB are flow charts illustrating the transmission of the 
second type of IBD frames in accordance with the present invention when 
the hangover procedure is not enabled; 
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Fig.12A and Fig.12B are flow charts illustrating the transmission of the 
third type of IBD frames in accordance with the present invention when the 
hangover procedure is not enabled; 

Fig. 13 illustrates an embodiment of transmitting the first (second) type 
of IBD frames in the present invention when the hangover procedure is not 
enabled; 

Fig.14 Illustrates an embodiment of transmitting the third type of IBD 
frames in the present invention when the hangover procedure is not enabled; 

Fig.15A and Fig.15B are flowcharts illustrating the transmission of the 
first type of IBD frames in accordance with the pres ent invention when the 
hangover procedure is enabled; 

Fig.16A and Fig.16B are flow charts illustrating the transmission of the 
second type of IBD frames in accordance with the present invention when 
the hangover procedure Is enabled; 

Fig.17A and Fig.17B are flow charts illustrating the transmission of the 
third type of IBD frames in accordance with the present invention when the 
hangover procedure is enabled; 

Fig. 18 illustrates an embodiment of transmitting the first (second , third) 
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type of IBD frames in the present invention when the hangover procedure is 
enabled. 

Detailed Description of the Invention 

In the present invention, non-speech data transferred via voice 
channel instead of dedicated data channel , is called as IBD (In-Band Data). 
A method is proposed for transferring IBD via voice channel, focusing on 
constructing three types of IBD frames for transferring non -speech data, 
based on conventional frames in mobile communication services. The three 
frames have different structures from the speech frame s and SID frames in 
current speech traffic, but of the same length. Thus, transmission of 
non-speech data can be realized without making significant modifications to 
the hardware equipments of current network system and mobile terminal 
hardware in mobile communication. 

GSM full -rate speech traffic will be exemplified below, to elaborate the 
three types of IBD frames constructed based on the frame structure as 
illustrated in Fig.3, in connection with Fig.6. Fig.7 and Fig.8. 

1. The first type of IBD frame 

The first type of IBD frame is marked by the 95 bits of the SID code 
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word. In the 95 bits. M bits are defined as the IBD code word postfix and the 
remained (95-M) bits are defined as the IBD code word prefix. That is: 
IBD code word = IBD code word p refix + IBD code word postfix 
In the IBD code word, all the (95 -M) bits as the prefix are zero, but the 
M bits as IJie postfix can't be zero at the same time. The IBD code word 
constructed in this way occupies the same bits as the SID code word, but the 
two types of code words can be differentiated according to whether the IBD 
code word postfix is zero or not. That is to say, the proposed method 
identifies a received firame as speech frame, SID frame or data flrame. only 
through checking the 95 -bit IBD code word in this frame. If the value of the 
IBD code word is zero, i.e. the prefix and postfix of the IBD code are both 
zero, the frame is a SID frame; if the value of the IBD code word is not zero 
and the IBD code word prefix is not zero, the flrame is a speech firame; only 
when the IBD code word is not zero and the prefix of the IBD frame word is 
zero while the postfix is not zero, the frame is an IBD frame. 

Since construction of the IBD frame has occupied M bits of the 95 bits for 
marking the SID frame in current frame, in o rder to transfer non -speech data 
using the first type of IBD flrame, it must be guaranteed that normal speech 
information is included in the speech flrame generated by the speech 
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encxKler, and the probability of the (95 -M) bits all being zero is equal to the 
probability of the 95 bits all being zero. In other words, it should be 
guaranteed during communication that the probability of the (95 -M) bits 
being used for speech frame is equal to the probability of the 95 bits being 
used for speech frame, that is, the frame consisting of the (95 -M) bits of zero 
value and M bits of non -zero, can only be an IBD frame rather than a speech 
frame in the present invention. As long as the value of M is chosen carefully, 
i.e., not too large, this condition can be satisfied easily. 

Fig.6 illustrates an embodiment of the above first type of IBD code 
word. In this embodiment, 3 bits out of the 95 bits are taken out to mark the 
IBD frame, that is M=3 (as to which 3 bits are selected, it is to be decided 
according to the speech encoder). As shown in the figure, it's assumed that 
bit 0 to bit 91 are defined as the IBD code word prefix while bit 92 to bit 94 as 
the IBD code word postfix. Depending on the value of bits 92,93 and 94, at 
most 7 kinds of the fii^t type of IBD frames can be defined in the embodiment 
shown in Fig.6. For example, we can define the IBD request frame as the 
one whose bits 92,93 and 94 are 0, 0 and 1 respectively, i.e. the one whose 
IBD code word prefix equals to 0 and IBD code word postfix equals to 1; and 
the IBD response frame as the one whose bits 92,93 and 94 are 0, 1 and 0 
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respectively, i.e. tlie one wliose IBD code word prefix equals to 0 and IBD 
code word postfix equals to 2. 

As illustrated above, transmission of non -speech data via voice 
channel can be realized by exploiting the first type of IBD frame without 
affecting current speech communication services, through redefining the bits 
in current SID code word. 

2. The second type of IBD frame 

As illustrated in the above description with Fig.3. there is the 6 -bit Block 
Amplitude parameter for denoting the amplitude of the speech signal in every 
sub-frame of a frame. No matter In speech frame or SID frame, the Block 
Amplitude parameter of the fi-ame, constructed by Block Amplitude 
parameters of 4 sub -frames (totally 24 bits), should not be all zero. 

The second type of In -Band Data frame, or namely the second type of 
IBD frame, can be identified by the above Block Amplitude parameter. 
Specifically, if the Block Amplitude parameter of a frame is zero, the frame is 
the second type of IBD frame; othenwise, the frame is speech frame or SID 
frame. 

But in this way. only one kind of the second type of IBD frame can be 
defined at the data link layer. If the second type of IBD frames of different 
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meanings are required to be transferred, ftirtlier settings need to be made. 

In a speech traffic, regarding a frame constructed by 260 bits, if the 156 
bits of the SID code word and the 24 bits of the Block Amplitude parameter 
are both zero simultaneously, the mobile terminal as receiver will regard this 
frame as a SID frame and provides it to the Rx comfort noise unit to generate 
background noise, because the SID code word is zero. Meanwhile, the Block 
Amplitude parameter is zero, thus this may result in sharp change in the 
background noise at the mobile terminal as receiver. Therefore, the SID code 
word and Block Amplitude parameter can 't be both zero simultaneously in a 
frame. According to this specification, in upper -layer protocols, some bits- in 
the SID code word can be used to identi fy the second type of IBD frames of 
different meanings, along with the above Block Amplitude parameter. 

According to the proposed second type of IBD frame, the Block 
Amplitude parameter of 24 bits is defined as the IBD code word postfix, and 
the bits drawn out of the SID code word for marking IBD frames of different 
meanings, are defined as the IBD code word prefix. That is: 

IBD code word = IBD code word prefix + IBD code word postfix 

Herein, the 24 bits as the IBD code word postfix are zero, while each bit 
in the prefix can't be zero simultaneously, T he IBD code word constructed in 
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this way can differentiate SID frame, speech frame and IBD frame distinctly, 
as well as implement transmission of IBD frames with different meanings via 
voice channel, through co mbining some bits in the SID code word with the 
Blocl< Amplitude parameter. 

Flg.7 illustrates an embodiment of the above second type of IBD code 
word. In this embodiment, 8 bits are drawn out of the SID code word to 
construct the prefix of the second type of IBD code word. As shown in the 
figure, it's assumed that bit 0 to bit 7 are defined as the IBD code word prefix 
while bit 8 to bit 31 are defined as the IBD code word postfix. Since the value 
of bit 8 to bit 31 is zero, at most 255 kinds of the second ty pe of IBD frames 
can be defined in the embodiment of Fig.7. according to the value of bit 0 to 
bit 7. For example, the IBD request frame can be defined as the one whose 
IBD code word prefix equals to 1 and postfix equals to 0, while the IBD 
response frame can be defined as the one whose IBD code word prefix 
equals to 2 and postfix equals to 0. 

As illustrated above, transmission of non -speech data via voice 
channel can be realized by using the proposed second type of IBD firame, 
through setting the Block AmpI itude parameter to zero and in combination 
with some bits in current SID code word. And furthermore, current speech 
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communication services won 't be affected. 
3. The third type of IBD frame 

The third type of In -Band data frame, or namely the third type of IBD 
frame, is composed of the 95 bits in SID code word and some reserved bits. 
The reserved bits are called as extended IBD code word, and they can be 
the bits reserved but not used yet in the SID frame. For example, in GSM 
full-rate speech traffic, the SID code word only occupies 95 bits of the 156-bit 
Xmc and the remained 61 bits are reserved for future use. The third type of 
IBD frame can be identified through using some of the reserved bits as the 
extended IBD code word. That is: 

IBD code word=SID code word + extended IBD word 

Fig.8 illustrates an embodiment of the above third type of IBD code 
word. In this embodiment, the extended IBD code word is defined as the first 
bits of the 68* parameter to the 76*** parameter in X^c (totally 9 bits). 
According to the 9 bits from Bit 0 to Bit 8. the IBD code word in Fig.8 can 
define (2 ®-1 =51 1 ) kinds of the third type of I BD frames at most. For example, 
the IBD request frame can be defined as the one whose extended IBD code 
word equals to 1 and SID code word equals to 0, while the IBD response 
frame can be defined as the one whose extended IBD code word equals to 2 
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and SID code word equals to 0. 

The three types of the proposed IBD frames are described above In 
detail In conjunction with Fig.6 to Fig.8. To implement transmission of the 
three types of IBD frames via voice channel, some modifications have to be 
made to the speech processing unit of current mobile terminals . The 
modified speech processing unit will be elaborated below, in conjuncfion with 
Fig.9. 

(1) In Tx DTX Handler Unit 90. a sending buffer 905 is added for storing 
the IBD frames to be transmitted, and a sending IBD flag SendlBDFlag is 
also added for indicating whether there are IBD frames stored in sending 
buffer 905. In this way, when upper -layer applications have some 
non-speech data to be transmitted, the mobile terminal encapsulate s the 
non-speech data into IBD frames and stores them in sending buffer 905, and 
then sets SendlBDFlag to 1, to notify Tx DTX control & operation unit 902 
that there are some frames to be transmitted in sending buffer 906 . After Tx 
DTX control & operation unit 902 sends all IBD frames stored in sending 
buffer 905 to Rx RSS 93, the mobile terminal sets SendlBDFlag to 0, to notify 
Tx DTX control & operation unit 902 that sending buffer 905 is empty. 

(2) In Rx DTX handler unit 100, a receiving buffer 1005 is added for 
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Storing the received IBD frames, and a receiving IBD flag Receive IBDFlag is 
also added for indicating whetlier there are IBD frames stored in receiving 
buffer 1005. In this way, when Rx DTX Control and Handle Unit 1001 
receives IBD frames, the mobile terminal store s the IBD frames in receiving 
buffer 1005, and meanwhile sets Receive IBDRag to 1, to notify upper-layer 
applications that there are IBD frames received in receiving buffer 1005. 
When upper-layer applications take out all the IBD frames stored in buffer 
1005, the mobile terminal sets Receive IBDFlag to 0, to notify upper-layer 
applications that receiving buffer 1005 is empty. 

(3) The queuing algorithm in Tx DTX control & operation unit 902 is 
modified, so that IBD frames can be sent to the Tx RSS. 

(4) Rx DTX control & operation unit 1001 is modified to identify the 
received IBD frames. 

(5) The upper-layer applications in the mobile terminal is provided with a 
data interface for reading or writing IBD frames, and thus upper -layer 
applications can write IBD frames into sending buffer 905 and read IBD 
frames from receiving buffer 1005 via the data interface. 

As explained in the above modifications, besides the dat a interface, 
sending buffer 905, SendlBDFlag, receiving buffer 1005, and 
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ReceivelBDFIag are added herein, the present invention only recommends 
modifications to Tx DTX control & operation unit 902 and Rx DTX control & 
operation Unit 1001, without making mo difications to speech encoder 901, 
VAD 903, Tx comfort noise unit 904, speech decoder 1002, speech frame 
substitution unit 1003. Rx comfort noise unit 1004, Tx RSS 93 and Rx RSS 
96. Thus it can be seen herein that very little modifications are required to be 
made in current mobile terminals to implement the present invention. 

In the following section, detailed descriptions will be given respectively 
to the transmission of the proposed first type of, second type of and third 
type of IBD frames via voice chan nel, exemplifying two modified mobile 
terminals supporting IBD frames in GSM full -rate speech traffic. 

I . The method for transmitting three types of IBD frames when the 
hangover procedure is not enabled 

(I) The method for transmitting the first type of IBD frame via voice channel 
when the hangover procedure is not enabled 

Fig. 1 0A and 10B are flow charts illustrating the transmission of the first 
type of IBD frames when the hangover procedure is not enabled, in 
accordance with the present invention . 
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As displayed in Fig. 1 0A, in the mobile terminal as sender, speech 
encoder 901 generates speech frames and sends them to Tx DTX control & 
operation unit 902 (step S10). On receipt of the speech frames firom speech 
encoder 901 , Tx DTX control & operation unit 902 checks the state of the 
VAD flag (step S20). 

1 . If the VAD flag is 1 , Tx DTX control & operation unit 902 sets the SP 
flag to 1 and sends the speech frames from speech encoder 901 to Tx RSS 
93, and then Tx RSS 93 forwards the received speech frames to the mobile 
terminal as receiver via the network system (step S30). 

2. If the VAD flag changes from 1 to 0. tha t means the speech burst is 
over. If the hangover procedure needn 't be enabled at this moment (that 
means N ©lapsed, the number of speech frames sent to Tx RSS since the last 
SID frame updates, is not more than the predefined value), Tx DTX control & 
operation unit 902 sets the SP flag to 0 and checks the state of SendlBDFlag 
(step S40). 

(1) If the SendlBDFlag is 0. that means sending buffer 905 is empty, 
then Tx DTX control & operation unit 902 sends SID frames to Tx RSS 93. 
After transmitting a received SI D frame to the mobile terminal as receiver via 
the network system, Tx RSS 93 stops fransmission and enters into idle state 
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(Step S50). 

(2) If SendlBDFlag is 1 , that means there are the first type of IBD frames 
to be transmitted in sending buffer 905, then Tx DTX control & operation unit 
902 sends the IBD frames from sending buffer 905 to Tx RSS 93. (a) If the 

5 VAD flag is still zero after all IBD frames are sent out, then Tx DTX control & 
operation unit 902 sends SID frames to Tx RSS 93. After transmitting t he 
received IBD frames and a SID frame to the mobile terminal as receiver via 
the network system, Tx RSS 93 stops transmission and enters into idle state, 
(b) If the VAD flag changes to 1 when the IBD frames are transmitted, it 

10 indicates there is new speec h burst to be transferred, then Tx DTX control & 
operation unit 902 stops sending IBD frames but begins to send new speech 
frames of the speech burst, to the mobile terminal as receiver via the network 
system. Tx RSS 93 sends the received IBD frames and ne w speech frames 
of the speech burst to the mobile terminal as receiver. Since no SID frame is 

15 received, Tx RSS 93 will not stop transmission and switch into idle state 
(step S60). 

As illustrated in Fig. 1 0B, in the mobile terminal as receiver, Rx RSS 96 
checks whether it has received frames sent from the mobile terminal as 
sender via the network system (step S100). If any frame is received, it will be 
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sent to Rx DTX control & operation unit 1001. On receipt of a frame from Rx 
RSS 96, Rx DTX control & operati on unit 1001 cinecks whether the SID code 
word of the frame is zero (step S1 1 0). 

1. If the SID code word is zero, it indicates the frame is a SID frame, 
then Rx DTX control & operation unit 1001 sends this frame to Rx c omfort 
noise unit 1004 for processing, to generate background noise (step S120). 

2. If the SID code word is not zero, Rx DTX control & operation unit 
1001 checks whether each bit as the IBD code word prefix in the SID code 
word is zero (step S130). (1) If the IBD code word prefix is not zero, it 
indicates the frame is a speech frame, then Rx DTX control & operation unit 
1001 sends the speech frame to speech decoder 1002, to generate speech 
signals (step S140). (2) If the IBD code word prefix is zero, it indicates the 
frame is an IBD frame, then Rx control & operation unit 1001 stores the IBD 
frame into receiving buffer 1005. and sets ReceivelBDFlag to 1 to notify 
upper-layer applications that there are IBD frames received (step SI 50), and 
meanwhile sends one of the latest received SID frames to Rx comfort noise 
unit 1004 to generate background noise (step SI 60). 

(II) The method for transmitting the second type of IBD frames via 
voice channel when the hangover procedure is not enabled 
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Fig.11A and Fig.11B are flow charts illustrating the transmiss ion of the 
secxind type of IBD frames when the hangover procedure is not enabled in 
accordance with the present invention. 

As Fig.11A illustrates, in the mobile terminal as sender, speech encoder 
901 generates speech frames and sends them to Tx DTX control & operation 
unit 902 (step S210). On receipt of the speech frames from speech encoder 
901. Tx DTX control & operation unit 902 checks the state of the VAD flag 
(step S220). 

1 . If the VAD flag is 1 , Tx DTX control & operation unit 902 sets the SP 
flag to 1 and sends the speech frames from speech encoder 901 to Tx RSS 
93. Then Tx RSS 93 fonfl^ards the received speech flames to the mobile 
terminal as receiver via the network system (step S230). 

2. If the VAD flag changes from 1 to 0, that means the speech burst is 
over. If the hangover procedure needn 't be enabled at this moment (that is, 
Neiapsed, the number of speech frames sent to the Tx RSS since the latest SID 
frame updates, is not greater than the predefined value), Tx DTX control & 
operation unit 902 sets the SP flag to 0 and checks the state of SendlBDFlag 
(step S240). 

(1) If SendlBDFlag is zero, it indicates sending buffer 905 is empty, then 
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Tx DTX control & operation unit 902 sends SID frames to Tx RSS 93. After 
sending a received SID frame to the mobile terminal as sender via the 
network system, Tx RSS 93 stops transmission and switches to Idle state 
(step S250). 

(2) If the SendlBDFlag is 1, it indicates there are the second type of IBD 
frames to be sent in sending buffer 905, then Tx DTX control & opera tion unit 
902 forwards the IBD frames stored in sending buffer 905 to Tx RSS 93. (a) 
If the VAD flag is still zero after all IBD frames in sending buffer 905 are sent 
out, Tx DTX control & operation unit 902 then sends SID frames to Tx RSS 
93. After sending the received IBD frames and a SID firame to the mobile 
terminal as receiver, Tx RSS 93 stops transmission and switches to idle state, 
(b) If the VAD flag changes to 1 during IBD frame transmission procedure, it 
indicates there is new speech burst to be tr ansferred, then Tx DTX control & 
operation unit 902 stops sending IBD frames but begins to send the speech 
frames of the new speech burst to Tx RSS 93. Tx RSS 93 sends the 
received IBD frames and the speech frames of the new speech burst to the 
mobile terminal as receiver. Since no SID frame is received, Tx RSS 93 will 
not stop transmission and switch to idle state (step S260). 

As displayed in Fig.11B, in the mobile terminal as receiver, Rx RSS 96 
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checks whether it has received frames sent from the mobile t erminal as 
sender via the network system (step S300). If any frame has been received, 
it will be sent to Rx DTX control & operation unit 1001. On receipt of a frame 
from Rx RSS 96, Rx DTX control & operation unit 1001 checks whether the 
SID code word of the frame is zero (step S310). 

1. If the SID code word equals to zero, it indicates that the frame is a 
SID frame, then Rx DTX control & operation unit 1001 sends the SID frame 
to Rx comfort noise unit 1004 for processing, to generate background noise 
(step S320). 

2. If the SID code word is not zero, Rx DTX control & operation unit 
1001 checks the value of each bit as the IBD code word in the frame, i.e. the 
value of the IBD code word prefix and that of the IBD code word postfix (step 
S330). 

(1) If the IBD code word prefix is not zero and the postfix is zero. It 
indicates the frame is the second type of IBD frame, thus Rx DTX control & 
operation unit 1001 stores the IBD frame in receiving buffer 1005, and sets 
ReceivelBDFlag to 1, to notify upper -layer applications that there are some 
IBD frames received (step S350). Then one of the latest received SID frames 
is sent to Rx comfort noise unit 1004 to generate the background noise (step 
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S360). 

(2) If the condition in (1) can 't be satisfied, i.e. the IBD code word prefix 
is not zero and the postfix is zero, it indicates the frame is a speech frame. 
Rx DTX control & operation unit 1001 sends the speech frame to speech 
decoder 1002 to generate speech signals (step S340). 

(Ill) The method for transmitting the third ty pe of IBD frames via 
voice channel when the hangover procedure is not enabled 

Fig.12A and Fig. 128 are flow charts illustrating the transmission of the 
third type of IBD frames when the hangover procedure is not enabled. 

As displayed in Fig.12A, in the mobi le terminal as sender, speech 
encoder 901 sends the generated speech frames to Tx DTX control & 
operation unit 902 (step S410). On receipt of the speech frames from speech 
encoder 901. Tx DTX control & operation unit 902 checks the state of the 
VAD flag (step S420). 

1 . If the VAD flag is found to be 1 , Tx DTX confrol & operation unit 902 
sets the SP flag to 1 and sends the speech frames from speech encoder 901 
to Tx RSS 93, then Tx RSS 93 sends the received speech frames to the 
mobile terminal as receiver v ia the network system (step S430). 
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2. If the VAD flag changes from 1 to 0, it means the speech burst is over. 
If hangover procedure needn 't be enabled at this moment (that is, N elapsed, 
the number of speech frames sent to the Tx RSS since the latest SID fr ame 
updates, is not greater than the predefined value), Tx DTX control & 
operation unit 902 sets the SP flag to 0 and then checks the state of 
SendlBDFlag (step S440). 

(1) If SendlBDFlag is 1, it indicates there are IBD frames to be 
transmitted in sending buffer 905. thus Tx DTX control & operation unit 902 
sends one of the third type of IBD frames in sending buffer 905 to Tx RSS 93. 
The IBD word of the third type of IBD frame contains the SID code word and 
the value of the SID code word is zero, so Tx RSS 93 will stop transmission 
and switch to idle state after sending the IBD frame to the network system as 
a SID frame (step S450). 

(2) If SendlBDFlag is 0, it indicates that sending buffer 905 is empty, 
thus Tx DTX control & operation unit 902 sends SID fram es to Tx RSS 93. 
After sending the received SID frames to the mobile terminal as receiver via 
the network system, Tx RSS 93 stops transmission and switches to idle state 
(step S460). 

As illustrated in Fig.12B, in the mobile terminal as receiver, Rx RSS 96 
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Checks whether it has received frames sent by the mobile terminal as sender 
(step S500). If any frame is received, it will be sent to Rx DTX control & 
operation unit 1001. On receipt of a frame firom Rx RSS 96, Rx DTX control 
& operation unit 1001 checl<s wh ether the SID code word of the frame is zero 
(step S510). 

1. If the SID code word is not zero, it indicates the frame is a speech 
frame, thus Rx DTX control & operation unit 1001 sends the speech frame to 
speech decoder 1002 for decoding (step S520). 

2. If the SID code word is zero, Rx DTX control & operation unit 1001 
checks whether the extended IBD code word in the frame is zero (step S530). 
(1) If the extended IBD code word is zero, it indicates the frame is a SID 
frame, then Rx DTX control & operation u nit 1001 sends the SID frame to Rx 
comfort noise unit 1004 to generate background noise (step S550). (2) If the 
extended IBD code word is not zero, it indicates the frame is an IBD frame, 
thus Rx DTX control & operation unit 1001 stores the IBD frame in re ceiving 
buffer 1005 and sets ReceivelBDFlag to 1, to notify upper-layer applications 
there are IBD frames received (step S540). Then, one of the latest received 
SID frames is sent to Rx comfort noise unit 1004 to generate background 
noise (step S560). 
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As illustrated in the above descriptions in connection with the flow 
charts in Fig.lOA and Fig.lOB, Fig.11A and Fig.11B, Fig.12A and Fig.12B, 
there are three differences among the transmission of the first type of, the 
second type of and the third type of IBD frames via voice channel when the 
hangover procedure is not enabled. (1) In the mobile terminal as sender, the 
three types of IBD frames are constructed differently, so the functional blocks 
in upper-layer applications for forming the three types of IBD fir ames should 
be different Furthermore, the SID code word of zero value is Included in the 
IBD code word of the third type of IBD frames, thus only one IBD fi^ame can 
be transferred and the Tx RSS will be turned off every time the speech burst 
is over, if non-speech data are transferred by using the third type of IBD 
firames. Whereas several consecutive IBD frames can be transferred through 
adopting the first type of or the second type of IBD frames every time the 
speech burst is over, when the hangover proced ure is not enabled. As long 
as no new speech burst is generated, all IBD frames to be transmitted can be 
sent out, and then a SID frame is sent for closing the Tx RSS. Except this, 
other functional blocks for transferring the three types of IBD frames are the 
same in the mobile terminal as sender. (2) In the mobile tenninal as receiver, 
the three types of IBD frames are constructed differently, so the functional 
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blocks in the Rx DTX control & operation unit for identifying the three types of 
IBD frames should be different too, and accordingly the functional blocks in 
upper-layer applications for decoding the three types of IBD frames should 
be of some difference. Except this, other functional blocks for processing the 
three types of IBD frames are the same i n the mobile terminal as receiver. 

Fig. 13 illustrates an embodiment of the present invention, for 
transmitting the first type of IBD frames when the hangover procedure is not 
enabled, which is equally applicable to the transmission of the second type, 
of IBD frames. As shown in the figure, the mobile terminal as sender sends 
two speech bursts, with the length of each speech burst as three speech 
firames. 

As to the first speech burst, the VAD flag and SP flag are both 1 during 
the burst period, so Tx RSS 93 transmits the three speech frames of the 
speech burst to the mobile terminal as receiver via the network system. 
When the speech burst is over, i.e. the VAD flag changes from 1 to 0, N ©lapsed, 
the number of speech frames sent to Tx RSS 93 since the latest SI D firanie 
updates, is not more than the predefined threshold of the hangover 
procedure, so Tx DTX control & operation unit 902 will not enable the 
hangover procedure and the SP flag is set to 0 meanwhile. At time t o during 
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the first speech burst, the upper -layer application has stored three IBD 
frames (IBD1, IBD2 and IBD3) in sending buffer 905, so Tx DTX control & 
operation unit 902 first sends the IBD frames stored in sending buffer 905 to 
Tx RSS 93 when said speech burst is over. After the above three IBD frames 

5 are all sent out, the VAD flag is still 0. thus Tx DTX control & operation unit 
902 continues to send SID frames to Tx RSS 93. After sending the received 
three IBD frames and one SID frame to the mobile terminal as receiver, Tx 
RSS 93 stops transmis sion and switches to idle state. In the mobile terminal 
as receiver, Rx RSS 96 correspondingly receives three speech firames ,iBD1, 

10 IBD2. IBD3 and a SID frame sent from the mobile terminal as sender via the 
network system, and then sends them to Rx DTX cont rol & operation unit 
1001. First, Rx DTX control & operation unit 1001 sends the three received 
speech frames to speech decoder 1002 for decoding, according to the 
receiving sequence order. Then, Rx DTX control & operation unit 1001 

15 stores IBD1, IBD2 and IBD3 in receiving buffer 1005, and sets the 
ReceivelBDFlag to 1 to notify upper -layer applications there are some IBD 
frames received. Afterwards, the previously received SID frame is sent to Rx 
comfort noise unit 1004 three times to generate background noi se (note that 
the three SID frames are the same). Finally, the received SID frame is sent to 
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Rx comfort noise unit 1004 to generate background noise. 

As to the second speech burst, the VAD flag and SP flag are both 1 
during the burst period, thus Tx RSS 93 sends the three speech frames of 
the speech burst to the mobile terminal as receiver via the network system. 
When the speech burst is over, i.e. the VAD flag changes to 0, N elapsed, the 
number of speech frames sent to Tx RSS 93 since the latest SID frame 
updates, is not greater than the threshold predefined by the hangover 
procedure, thus Tx DTX control & operation unit 902 will not enable the 
hangover procedure and set the SP flag to 0. At time t i during the speech 
burst, a frame IBD4 with respect to the second type of IBD flrame is stored in 
sending buffer 905, so the Tx DTX control & operation unit 902 first sends 
IBD4 to Tx RSS 93 when the speech burst is over. After IBD4 is sent out, the 
VAD flag is still 0, so Tx DTX control & operation unit 902 contin ues to send 
SID flrames to Tx RSS 93. After sending the received IBD4 and an SID frame 
to the mobile terminal as receiver via the network system, Tx RSS 93 stops 
transmission and switches to idle state. In the mobile terminal as receiver, Rx 
RSS 96 receives three speech frames of the speech burst, IBD4 and a SID 
frame from the mobile temiinal as sender via the network system, and then 
hands them to Rx DTX control & operation unit 1001. According to the 
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receiving order, Rx DTX control & operation unit 1001 fi rst sends the tliree 
received speech frames to speech decoder 1002 for decoding; then stores 
IBD4 in receiving buffer 1005 and sets ReceivelBDFlag to 1 to notify 
upper-layer applications that some IBD frames are received; afterward sends 
a previously received SID frame to Rx comfort noise unit 1004 to generate 
background noise; and finally, sends the received SID frame to Rx comfort 
noise unit 1 004 to generate background noise. 

Fig. 14 illustrates an embodiment of the present invention, for 
transmitting the third type of IBD frames when the hangover procedure is not 
enabled. As displayed in the figure, the mobile terminal as sender sends out 
two speech bursts and the length of each speech burst is three speech 
frames. 

With regard to the first speech burst, the VAD flag and SP flag are both 1 
during the burst period, so Tx RSS 93 sends the three speech frames of the 
speech burst to the mobile terminal as receiver via the network system. 
When the speech burst is over, i.e. the VAD flag changes from 1 to 0, N ©lapsed, 
the number of speech frames sent to Tx RSS 93 since the latest SID frame 
updates, is not greater than the threshold predefined by the hangover 
procedure, so Tx DTX control & operation unit 902 will not enable the 
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hangover procedure and the SP flag is s et to 0 meanwhile. At time to during 
the first speech burst, upper -layer applications are required to encapsulate 
the data or signaling messages to be transmitted to the mobile terminal as 
receiver into a frame with respect to the third type of IBD frame a nd stores it 
in sending buffer 905. so Tx DTX control & operation unit 902 takes the IBD 
frame out and sends it to Tx RSS 93. The SID code word of zero value is 
included in the third type of IBD frame, thus Tx RSS 93 sends the IBD frame 
to the mobile terminal as receiver via the network system as a SID frame, 
then stops transmission and switches to idle state. In the mobile terminal as 
receiver, Rx RSS 96 receives three speech frames and an IBD frame sent by 
the mobile terminal as sender via the network sys tem, and hands them to Rx 
DTX confrol & operation unit 1001. Rx DTX confrol & operation unit 1001 first 
sends the three received speech frames to speech decoder 1002 for 
decoding; then stores the received third type of IBD frame in receiving buffer 
1005. and sets ReceivelBDFlag to 1 to notify upper -layer applications about 
the arrival of IBD frames; and meanwhile sends a previously received SID 
frame to Rx comfort noise unit 1004 to generate background noise. 

With regard to the second speech burst, the VAD flag and SP flag are 
both 1 during the burst period, so Tx RSS 93 sends the three speech frames 



wo 2005/048620 PCT/IB 2004/0523 19 

44 

of the speech burst to the mobile terminal as receiver directly. When the 
speech burst is over, i.e. the VAD flag changes from 1 to 0, N erapsed» the 
number of speech frames sent to Tx RSS 93 since the latest SID frame 
updates, fs not greater than the threshold predefined by the hangover 

5 procedure, thus Tx DTX control & operation unit 902 will not enable the 
hangover procedure and the SP flag is set to 0 meanwhile . Since sending 
buffer 905 is empty, Tx DTX control & operation unit 902 sends SID frames 
previously received to Tx RSS 93. After sending a received SID frame to the 
mobile terminal as receiver via the network system, Tx RSS 93 stops 

10 transmission and switc hes to idle state. In the mobile terminal as receiver. Rx 
RSS 96 receives three speech frames and a SID frame sent from the mobile 
terminal as sender via the network system, and then sends them to Rx DTX 
control & candle unit 1001.Rx DTX control & operatic n unit 1001 first sends 
the three received speech frames to speech decoder 1002 for decoding, and 

15 then sends the received SID frame to Rx comfort noise unit 1004 to generate 
background noise. 

II . The method for transmitting three types of IBD frames when the 
hangover procedure is enabled 
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(I) The method for transmitting the first type of IBD frames via voice 
channel when the hangover procedure is enabled 

Fig.15A and Fig.lSB are flow charts illustrating the transmission of the 
first type of IBD frames in accordance with the present invention when 
hangover procedure is enabled. 

As displayed in the Fig.15A, in the mobile temiinal as sender, speech 
encoder 901 sends the generated speech frames to Tx DTX control & 
operation unit 902 (step S65). On receipt of the speech firames, Tx DTX 
control & operation unit 902 checks the state of the VAD flag (step S70). 

1. If the VAD flag is 1. Tx DTX control & operation unit 902 sets the SP 
flag to 1 and sends the speech frames from speech encoder 901 to Tx RSS 
93. Then, Tx RSS 93 sends the received speech frames to the mobile 
terminal as receiver via the network system (step 875). 

2. If the VAD flag changes flrom 1 to 0, it means the speech burst is over. 
If the hangover procedure needs to be enabled at this moment (N ©lapsed, the 
number of speech firames sent to Tx RSS 93 since the latest SID frame 
updates, is greater than the predefined threshold), Tx DTX control & 
operation unit 902 continues to set the SP flag to 1 and checks the state of 
SendlBDFlag (step S80). 



wo 2005/048620 



PCT/IB2004/052319 



46 

(1) If SendlBDFlag is 0, it indicates sending buffer 905 is empty, thus 
Tx DTX control & operation unit 902 sends N (N is the number of silence 
speech frames for computing the SID frame) silence speech frame s from 
speech encoder 901 to Tx RSS 93, then sets the SP flag to 0. and 
meanwhile sends the new SID frame generated according to the N silence 
speech frames to Tx RSS 93. After sending the received silence speech 
frames and a SID firame to the mobile termin al as receiver, Tx RSS 93 stops 
transmission and switches to idle state (step S85). 

(2) If SendlBDFlag is 1 it indicates sending buffer 905 has IBD frames 
to be sent, then Tx DTX control & operation unit 902 sends IBD frames with 
the same number as the si lence speech frames for computing the SID frame, 
to Tx RSS 93 (If the IBD frames are not enough, silence speech frames from 
speech encoder 901 will come as complement). 

(a) If the VAD flag is still zero after said IBD frames are all sent out, Tx 
DTX control & operation unit 902 sets the SP flag to 0 and then sends a SID 
frame to Tx RSS 93. wherein the SID frame is computed according to the N 
silence speech frames. After sending the received IBD frames (or IBD 
frames and silence speech frames) and a SID fram e to the mobile terminal 
as receiver via the network system, Tx RSS 93 stops transmission and 
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switches to idle state. 

(b) If the VAD flag changes to 1 when the IBD frames (or silence speech 
frames) are transmitted to Tx RSS 93, it indicates there is a new speech 
burst to be transferred, thus Tx DTX control & operation unit 902 stops 
sending IBD frames (or silence speech frames) but begins to send the 
speech frames of the new speech burst, to Tx RSS 93. Tx RSS 93 sends the 
received IBD frames (or IBD frames and silence speech frames) and the 
speech frames of the new speech burst to the mobile terminal as receiver. 
Since no SID frame is received, Tx RSS 93 will not stop fransmission and 
switch to idle state (step S90). 

As shown in Fig.15B, in the mobile termin al as receiver, Rx RSS 96 
checks whether it has received frames sent by the mobile terminal as sender 
via the network system. If any frame is received, it will be sent to Rx DTX 
control & operation unit 1001. On receipt of a frame from Rx RSS 96. Rx 
DTX confrol & operation unit 1001 checks whether the SID code word of the 
frame is zero (step S170). 

1. If the SID code word equals to zero, it indicates the frame is a SID 
frame, then Rx DTX control & operation unit 1001 sends the SID frame to Rx 
comfort noise unit 1004 for processing, to generate background noise (step 
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S175). 

2. If the SID code word is not zero, Rx DTX control & operation unit 1 001 
checks whether the IBD code word prefix of the frame is zero (step S180). (1) 
If the IBD code word prefix is not z ero, it indicates that the frame is a speech 
frame, then Rx DTX control & operation unit 1001 sends the speech frame to 
speech decoder 1002 to generate speech signals (step S185). (2) If IBD 
code word prefix is zero, it indicates that the frame is an IBD f rame. then Rx 
DTX control & operation unit 1001 stores the IBD frame in receiving buffer 
1005. and sets Receive! BDFlag to 1 to notify upper-layer applications that 
some IBD frames are received (step SI 90). and aftenA^ard sends one of the 
latest received SID frames to Rx comfort noise unit 1004 to generate 
background noise (step SI 95). 

(II) The method for transmitting the second type of IBD frames via 
voice channel when hangover procedure is enabled 

Fig.lSA and Fig.16B illustrate the flow charts of transmitt ing the second 
type of IBD frames when hangover procedure Is enabled. 

As illustrated in Fig.16A, in the mobile terminal as sender, speech 
encoder 901 generates speech frames and sends them to Tx DTX control & 
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operation unit 902 (step S265). On receipt of t he speech frames from speech 
encoder 901 , Tx DTX control & operation unit 902 checl^s the state of the 
VAD flag (step S270). 

1 . If the VAD flag is 1, Tx DTX confrol & operation unit 902 sets the SP 
flag to 1 and sends the speech frames from speech encoder 9 01 to Tx RSS 
93. Then, Tx RSS 93 sends the received speech frames to the mobile 
terminal as receiver via the network system (step S275). 

2. If the VAD flag changes from 1 to 0, it indicates the speech burst is. 
over. If hangover procedure is required to be enabled (that is, N ©lapsed, the 
number frames sent to Tx RSS 93 since the last SID frame updates, is 
greater than the threshold predefined by the hangover procedure). Tx DTX 
control & operation unit 902 enables the hangover proceduris, continues to 
set the SP flag to be 1, and checks the state of SendlBDFlag (step S280). 

(1) If SendlBDFlag is 0, i.e. sending buffer 905 is empty, Tx DTX control 
& operation unit 902 first sends N (N is the number of silence speech frames 
for computing the SID frame) silence sp eech frames from speech encoder 
901 to Tx RSS 93. and then sets the SP flag to 0, and sends the new SID 
frame generated according to the N silence speech frames to Tx RSS 93. 
After sending the received silence speech frames and a SID frame to the 
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mobile terminal as receiver, Tx RSS 93 stops transmission and switches to 
idle state (step S285). 

(2) If SendlBDFlag is 1, i.e. there are IBD frames to be transmitted in 
sending buffer 905, Tx DTX control & operation unit 902 sends IBD frames 
with the same number as the silence speech frames for computing the SID 
frame, to Tx RSS 93 (If the IBD frames are not enough, silence speech 
frames from speech encoder 901 will come as complement). 

(a) If the VAD flag is still zero after said IBD frames are all sent out, Tx 
DTX control & operation unit 902 sets the SP flag to 0, and then sends a SID 
frame to Tx RSS 93, wherein the SID frame is generated according to the N 
silence speech frames. After sending the received IBD frames (or IBD 
frames and silence speech frames) and a SID frame to the mobile terminal 
as receiver via the network system, Tx RSS 93 stops transmission and 
switches to idle state. 

(b) If the VAD flag changes to 1 when IBD frames (or silence speech 
frames) are transmitted to Tx RSS 93, it indicates there Is a new speech 
burst to be transferred. Tx DTX control & operation unit 902 stops sending 
IBD frames (or silence speech frames) but begins to send the speech frames 
of the new speech burst to Tx RSS 93. Tx RSS 93 sends the received IBD 
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frames (or IBD frames and SID frames) and the speech frames of the new 
speech burst to the mobile terminal as receiver. Since no SID frame is sent, 
Tx RSS 93 will not stop fransmisslon and switch to idle state (step S290). 

As shown in Fig.lSB, in the mobile tenminal as receiver , Rx RSS 96 
checks whether it has received frames from the mobile terminal as sender 
via the network system (step S365). If any frame is received, it will be sent to 
Rx DTX control & operation unit 1001. On receipt of a frame from Rx RSS 96, 
Rx DTX control & operation unit 1001 checks whether the SID code word of 
the frame is zero (step S370). 

1. If the SID code word is zero, it indicates the frame is a SID frame, 
then Rx DTX control & operation 1001 sends the SID frame to Rx c omfort 
noise unit 1004 for processing, to generate background noise (step S375). 

2. If the SID code word is not zero, Rx DTX confrol & operation unit 1001 
checks the value of the IBD code word of the frame, i.e. the value of the IBD 
code word prefix and that of the IBD code word postfi x in the frame (step 
S380). 

(1) If the IBD code word prefix is not zero and the IBD code word postfix 
is zero, it indicates the frame is an IBD frame. Rx DTX control & operation 
unit 1001 stores the IBD frame in receiving buffer 1005. and sets 
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ReceivelBDFlag to 1 to notify upper -layer applications that some IBD frames 
are received (step S390). and then sends one of the latest received SID 
frames to Rx comfort noise unit 1004 to generate background noise (step 
S396), 

5 (2) If the condition in (1) can't be satisfied, i.e. the condition can't be 

satisfied that the IBD code word prefix is not zero and the postfix is zero, it 
indicates the frame is a speech frame. Rx DTX control & operation unit 1001 
sends the speech frame to speech decoder 1002 to generate speech signals 
(step S385). 

10 (III) The method for transmitting the third type of IBD frames via voice 
channel when hangover procedure Is enabled 

Fig.17A and Fig.17B are flow charts illustrating the transmission of frie 
third type of IBD firames in the present invent ion when hangover procedure Is 
enabled. 

15 As shown In Fig.lTA, in the mobile terminal as sender, speech encoder 

901 sends the generated speech frames to Tx DTX control & operation unit 

902 (step S465). On receipt of the speech frames from speech encoder 901 , 
Tx DTX control & operation unit 902 checks the state of the VAD flag (step 
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S470). 

1. If the VAD flag Is 1, Tx DTX control & operation unit 902 sets the SP 
flag to 1 and sends the speech frames from speech encoder 901 to Tx RSS 
93. Then, Tx RSS sends the r ecelved speech frames to the mobile terminal 
as receiver via the network system (step S475). 

2. If the VAD flag changes from 1 to 0, it means the speech burst is over. 
If the hangover procedure is required to be enabled at this moment (that is, 
Neiapsed, the number of speech frames sent to Tx RSS 93 since the last SID 
frame updates, is greater than the threshold predeflned by tiie hangover 
procedure), Tx DTX control & operation unit 902 continues to set the SP flag 
to 1 and checks the state of SendlBDFlag (s tep S480). 

(1) If SendlBDFlag is zero, it indicates sending buffer 905 is empty. Tx 
DTX control & operation unit 902 first sends N (N is the number of silence 
speech frames for computing the SID frame) consecutive silence speech 
frames from speech encoder 901 to Tx RSS 93, then sets the SP flag to 0 
and sends the new SID frame generated according to the N silence speech 
frames to Tx RSS 93. After sending the received silence speech frames and 
a SID frame to the mobile terminal as receiver via the network system, Tx 
RSS 93 stops transmission and switches to idle state (step S490). 
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(2) If SendlBDFlag is IJt indicates sending buffer 905 has IBD frames 
to be transmitted, then Tx DTX control & operation unit 902 first sends IBD 
frames of not more than N. to Tx RSS 93 (if the IBD frames are not enough, 
silence speech frames from speech encoder 901 will come as complement). 

(a) If the VAD flag is still zero after the IBD frames are all sent out, Tx 
DTX control & operation unit 902 sets the SP flag to 0 and then sends a SID 
frame to Tx RSS 93, wherein the SID frame is computed according to the N 
silence speech frames. After sending the received IBD frames (or IBD 
frames and silence speech frames) and a SID frame to the mobile terminal 
as receiver, Tx RSS 93 stops tr ansmission and switches to idle state. 

(b) If the VAD flag changes to 1 when the IBD frames (or silence 
speech frames) are transmitted, it indicates that there is a new speech burst 
to be transferred. Tx DTX control & operation unit 902 stops sending IBD 
frames (or silence speech frames) but begins to send the speech frames of 
the new speech burst to Tx RSS 93. Tx RSS 93 sends the received IBD 
frames (or IBD frames and silence speech frames) and the speech frames of 
the ne>A^ speech burst to the mobile termin al as receiver. Since no SID frame 
is sent, Tx RSS 93 will not stop transmission and switch to idle state (step 
S485). 
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As Fig.17B shows, in the mobile terminal as receiver, Rx RSS 96 
checks whether it has receives frames from the mobile terminal as sender 
via the network system (step S565). If any frame is received, it will be sent to 
Rx DTX control & operation unit 1001 . On receipt of a frame from Rx RSS 96, 
Rx DTX control & operation unit 1001 checks whether the SID code word of 
the frame is zero (step S5 70). 

1. If the SID code word is not zero, it indicates the frame is a speech 
frame, then Rx DTX control & operation unit 1001 sends this speech frame to 
speech decoder 1002 for decoding (step S575). 

2. If the SID code woi'd is zero, Rx DTX control & operati on unit 1001 
checks whether the extended IBD code word of the frame is zero (step S580). 
(1) if the extended IBD code word is zero, it indicates the frame is a SID. 
frame, then Rx DTX control & operation unit 1001 sends the SID frame to Rx 
comfort noise unit 1004 to generate background noise (step S590). (2) If the 
extended IBD code word is not zero, it indicates the frame is a n IBD frame, 
thus Rx DTX control & operation unit 1001 stores the IBD frame into 
receiving buffer 1005, and sets ReceivelBDFlag to 1 to notify upper -layer 
applications that some IBD frames are received (step S585), and then sends 
one of the latest received SID frames to Rx comfort noise unit 1004 to 



wo 2005/048620 



PCT/IB2004/052319 



56 

generate background noise (step S595). 

Similar to the above cases when the hangover proc edure is not enabled 
as illustrated in Fig. 1 0A and Fig. 1 0B. Fig.11 A and Fig.HB. Fig.12A and 
Flg.12B, the flow charts in Fig.lSA and Fig.15B, Fig.16A and Fig.16B. 
Fig.17A and Fig.17B elaborate the transmission of the three types of IBD 
frames when hangover procedure is enabled. The main differences between 
them can be summarized as: (1) In the upper -layer applications of the mobile 
terminal as sender, the functional blocks for forming the three types of IBD 
frames are different Except this, other functiona I blocks for transferring the 
three types of IBD frames are the same. (2) In the Rx DTX control & 
operation unit of the mobile terminal as receiver, the functional blocks for 
identifying the three types of IBD frames are different, and correspondingly 
the functional blocks in upper -layer applications for decoding the three types 
of IBD frames should be of some difference too. Except this, in the mobile 
terminal as receiver, other functional blocks for processing the three types of 
IBD frames are the same. 

Fig.18 illustrates an embodiment of the proposed method for 
transmitting the first type of IBD frames, and the procedure as shown in the 
figure is equally applicable to the second and third types of IBD frames. 
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As the figure displays, in the mobile terminal as sender, the VAD flag 
and SP flag are both 1 during the burst period, so Tx RSS 93 sends the 
speech frames of the speech burst to the mobile terminal as receiver via the 
network system. When the speech burst is over, i.e. the VAD flag changes to 
5 0, Neiapsed» the number of speech frames sent to Tx RSS 93 since the latest 
SID frame updates, is greater than the threshold predefined by the hangover 
procedure, so Tx DTX control & operation unit 902 will enable the hangover 
procedure and continues to set the S P flag to 1 meanwhile. Since there are 
only two IBD frames, IBD1 and IBD2 in sending buffer 905, Tx DTX control &- 

10 operation unit 902 sends IBD1 and IBP2 and two silence speech frames 
from speech encoder 901 to Tx RSS 93, When the four frames are ail sent 
out, the VAD flag is still zero, thus Tx DTX control & operation unit 902 sets 
the SP flag to 0, and then sends SID frames (e.g. SID r+i and SIDk4.2in the 
figure) to Tx RSS 93. After sending the speech firames of the speech burst, 

15 IBD1. IBD2. two silence speech frames and SIDk+i received to the mobile 
terminal as receiver via the network system, Tx RSS 93 stops transmission 
and switches into idle state. 

At the mobile terminal as receiver. Rx RSS 96 receives the speech 
frames of the speech burst, IBD1, IBD2, two silence speech frames, and 
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SIDk+1 from the mobile temiinal as sender via ttie network system, and then 
sends them to Rx DTX control & operation unit 1001. Rx DTX control & 
operation unit 1001 first sends the received speech frames in receiving order, 
to speech decoder 1002 for decoding; then stores IBD1 and IBD2 into 
receiving buffer 1005, and sets ReceivelBDFlag to 1 to notify upper -layer 
applications that some IBD frames are received; afterward sends two 
previously received SID frames (e.g. SID kin the figure) to Rx comfort noise 
unit 1004; next sends the two received silence speech frames to speech 
decoder 1002 for decoding; and finally sends SID k+i to Rx comfort noise unit 
1004 to generate background noise. 

The above section describes the procedure for transferring IBD frames 
via voice channel between two mobile terminals both supporting IBD frames 
in GSM full-rate speech traffic, in conjunction with Fig.lOA to Fig. 18. In the 
following section, an embodiment will go to a mobile terminal supporting IBD 
frames and another mobile terminal not supporting IBD frames, to describe 
the transmission of IBD frames between them. 

It's assumed that mobile terminal MSI supports IBD frames while 
mobile terminal MS2 doesn 't. If MS2 receives a frame with respect to the fi rsf 
type of IBD frames from MSI , it will regard the IBD frame as a speech frame 
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and send it directly to the speech decoder for decoding because the SID 
code word of the first type of IBD frame is not zero. If the speech decoder 
generates felse speech signa Is by using this IBD frame, the listener may feel 
very uncomfortable, because the false speech signal might contain very high 
energy and mismatch with the speech signals generated from other normal 
speech frames. 

If MS2 receives a frame with respect to the second type of IBD frame 
from MS1. It will also regard the IBD frame as a speech frame and send it 
directly to the speech decoder for decoding to generate false speech signals 
because the SID code word in the IBD frame is not zero. Fortunately, the 24 
bits for Block Amplitude parameter are defined as the IBD code word postfix 
in the second type of IBD frame and all the 24 bits are set to zero, so. the 
speech signal generated according to Block Amplitude parameter of the 
second type of IBD frame has very low energy. Even if the false speech 
signal mismatches with normal speech frames, no significant impact will 
occur on the listener. 

If MS2 receives a frame with respect to the third type of IBD frame from 
MS1, MS2 will store this IBD frame as a new SID frame and generate 
background noise using this IBD frame because the SID code word of the 



wo 2005/048620 



PCT/IB2004/052319 



60 

third type of IBD frame is zero. The third type of IBD frame is not a true SID 
frame, and thus no background noise parameters are included, so the 
generated noise Is false background noise and will only impose 
uncomfortabieness to human hearing in some certain period (for example, 
20ms in GSM/GPRS). 

As stated in the above analysis, the first and third type s of IBD frame will 
produce significant influence upon mobile terminals that don't support IBD 
frames, whereas the second type of IBD firame doesn't have much impact on 
mobile terminals that don't support IBD frames. 

Two solutions are provided in the present invention, to eliminate the 
negative effects of IBD frames on mobile terminals that don't support IBD 
frames. 

Solution 1: IBD frame should carefully use the bits for various 
parameters in the speech frame. For example, if the 24 bits for Block 
Amplitude parameter are set to zero or very low value in the first and third 
types of IBD frames, the IBD frame will have very low energy, and thus won 't 
cause uncomfortabieness to the listener. 

Solution 2: A new communication protocol should be defined. In this 
new protocol, a mobile terminal first send s a probing frame (the probing 
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frame can be of very low energy by setting each bit to appropriate value ) to 
another mobile terminal before sending IBD frames, to check whether said 
another mobile terminal supports IBD frames. If said another mobile terminal 
supports IBD frame, it will sends back a probing response in return to said 
mobile terminal after receiving the probing frame ; otherwise, it just ignores 
the received probing frame. If the mobile terminal sending the probing frame 
receives a probing response from said another mobile terminal, IBD frames 
can be used during communication procedure; otherwise, IBD frames cannot 
be used during communication procedure . 

In the embodiments of the present invention, GSM full -rate speech 
traffic is taken as a example, to describe the methods for transfem'ng the first, 
second and third types of IBD frames via voice channel when the hangover 
procedure is enabled and when the hangover procedure is not enabled. This 
methods can be implemented in software or in hardware. Furthermore, the 
principle a nd implementation procedure can equally extend to other GSM 
speech traffics. 

Beneficial Results of the invention 

As described above, with regard to the method and apparatus for 
transmitting non -speech data via voice channel in the present invention, 
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silence speech frames and SID frames are utilized to transmit IBD frames via 
voice channel, thus the system resources can be saved a lot Furthermore, 
from Fig .9 and its explanation thereof, we can also draw a conclusion that 
little modifications need to be made to current mobile terminals (The 
sending/receiving buffer in the present invention just expands the original 
buffer of current system s, and the added data interface is also very simple. In 
fact, the biggest change occurs in the scheduling and classifying algorithms 
in the DTX control & operation unit, but this is just a modification in software, 
by improving the original algorithm for classifying two types of frames in to a 
modified algorithm capable of classifying /scheduling three types of frames. 
This can be implemented in software or in hardware, and won't bring 
significant and difficult modifications to current systems). Additionally, the 
proposed method can eliminate the negative effects of transmitting IBD 
frames upon mobile terminals not supporting IB D frames, through lowering 
the value of the bits for carrying background noise information in the IBD 
fi^ame or sending probing frame. 

It is to be understood by those skilled in the art that the method and 
apparatus for transmitting non -speech data in voice channel as disclosed in 
tine present invention can be modified considerably without departing from 
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the spirit and scope of the invention as defined by the appended claims . 
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What is claimed is: 

1. A method for a mobile temiinal In mobile communication systems to 
transfer non -speech data over voice channel, comprising steps of: 

(a) detecting \A4iether the speech burst sent to another mobile terminal is 
5 over; 

(b) checking whether there is non-speech data to be sent to said another 
mobile terminal if detecting that the speech burst is over; 

(c) sending at least one non -speech data to said another mobile terminal via 
voice channel if there is non -speech data to be sent. 

10 2. The method as claim 1 , \A4ierein before step (a), further including steps of: 

(i) encapsulating said non -speech data to be sent to said another mobile 
terminal into IBD (In -Band Data) frames; 

(ii) storing the IBD frames in a buffer. 

3. The method as claim 2, wherein t he IBD code word for marking said IBD 
15 firames is composed of the SID (Silence Description) code word for marking 
a SID frame, and the value of each bit selected from the bits that form the 
SID code word, for differentiating the IBD code word from the SID co de word, 
can't be the same as that of each bit for marking the SID code word. 
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4. The method as claim 3. wherein the number of said selected bits is 
required to ensure that the value of each bit forming said IBD code word will 
not appear in speech frames. 

5. The method as claim 2, wherein the IBD code word for marking said IBD 
frames is composed of all bits for carrying Block Amplitude parameter and at 
least one bit selected from the SID code word for marking said a SID frame, 
and the value of each said bit for carrying Block Amplitude parameter is zero, 
and the value of each said bit selected from the SID code word can 't be the 
same as that of each bit for marking the SID code word. 

6. The method as claim 2, wherein the IBD code word for marking said IBD 
frames is composed of the SID code word for marking a SID frame and at 
least one reserved bit not included In the SID code word. 

7. The method as any one of claim 3 to 6, wherein said IBD frames have the 
same length as said SID frame and the speech frames sen t from the mobile 
terminal to said another mobile terminal. 

8. The method as claim 7, wherein said IBD frames are sent during the time 
when said SID frame is supposed to be sent in conventional communications 
if hangover procedure is not enabled. 
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9. The method as claim 7, wherein if hangover procedure is enabled, said 
IBD frames are sent during the time when the silence speech frames are 
supposed to be sent in conventional communications, wherein said silence 
speech frames are used to compute said SID frame . 

10. The method as any one of claim 3 to 5, further comprising steps of: 

(d) pausing sending said IBD frames if detecting that a new speech burst is 
required to transfer to said another mobile terminal while said IBD frames 
have not all been sent out yet; and 

(e) sending the new speech burst to said another mobile terminal. 

1 1 . The method as any one of claim 3, 4 and 6, wherein the value of the bits 
for carrying Block Amplitude parameter in said IBD frames is set to zero or 
nearly zero. 

12. The method as claim 1, before executing step (c), further comprising 
steps of: 

sending a probing frame to said another mobile terminal to check 
whether said another mobile terminal supports IBD frames; and 

sending IBD frames to said another mobile terminal if receiving the 
confirmation response from said another mobile terminal. 
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13. A method for a mobile terminal to transfer non -speech data in voice 
channel, comprising steps of: 

(I) detecting the received frame from another mobile terminal; 
(ii) storing, if the received frame Is a IBD (In -Band Data) frame, the IBD 
5 frame; 

(ill) generating background noise by using the previously received SID firame. 

14. The method as claim 13, wherein the IBD code word for marking said 
IBD frames is composed of the SID (Silence Description) code word for 
marking a SID frame, and the value of each bit selected from the bits that 

10 form the SID code word, for differentiating the IBD code word from tine SID 
code word, can't be the same as that of each bit for marking the SID code 
word. 

15. The method as claim 13, wherein tiie IBD code word for marking said 
IBD frames is composed of all bits for canrying Block Amplitude parameter 

15 and at least one bit selected from the SID code word for marking a SID frame, 
and the value of each said bit for carrying Block Amplitude parameter Is zero, 
and tiie value of each said bit selected from the SID code word can 't be the 
same as that of each bit for marking the SID code word. 
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16. The method as claim 13, wherein the IBD code word for marking said 
IBD frames is comp osed of the SID code word for marking a SID frame and 
at least one reserved bit not included in the SID code word. 

17. The method as any one of claim 14 to 16, wherein said IBD frames have 

5 the same length as said SID frame and the speech frames from said a nother 
mobile terminal. 

18. The method as claim 14, wherein step (i) further includes: 
(a1) checking the SID code word of said received frame; 

(a2) detecting the value of the remained bits after said bits are selected from 
10 the SID code word to judge whethe r said received frame is an IBD frame, if 
the SID code word indicates that said received frame is not a SID frame. 

19. The method as claim 15, wherein step (i) further includes: 
(a2) checking the SID code word of said received frame; 

(b2) detecting the val ue of all bits for carrying Block Amplitude parameter and 
15 the value of said bits selected from the SID code word, to judge whether said 
received frame is an IBD frame, if the SID code word indicates that said 
received frame is not a SID frame. 

20. The methpd as claim 16, wherein step (i) further includes: 
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(a3) checking the SID code word of said received frame; 
(b3) detecting the value of said reserved bits not included in the SID code 
word, to judge whether said received frame is an IBD frame, if the SID co de 
word indicates that said received frame is not a speech frame. 

21 . The method as any one of claim 14 to 16, wherein the value of the bits for 
carrying Block Amplitude parameter in said IBD frames is set to zero or 
nearly zero. 

22. The method as any one of claim 18 to 20, wherein further comprising 
steps of: 

(c) receiving a probing frame from said another mobile terminal; 

(d) returning a confirmation response to said another mobile terminal if the 
mobile terminal supports IBD frames. 

23. A mobile terminal , comprising: 

the first detecting unit, for detecting whether a speech burst sent from 
the mobile terminal to another mobile terminal is over, and checking whether 
there is non-speech data to be sent to said another mobile terminal when 
detecting that the speech burst is over; 

a sending unit, for sending frames to said another mobile terminal; 
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a control unit, for controlling said sending unit to send at least one 
non-speech data frame to said another mobile terminal via voice channel 
when there is non -speech data to be sent. 

24. The mobile terminal in claim 23, further comprising: 

an IBD (In-Band Data) frame generating unit, for encapsulating the 
non-speech data to be sent to said another mobile terminal into IBD frames; 
the first buffer, for storing the gen erated IBD frames. 

25. The mobile terminal in claim 24, wherein the IBD code word for marking 
said IBD frames is composed of the SID code word for marking a SID frame, 
and the value of each bit selected from the bits that form the SID code word, 
for differentiating said IBD code word from said SID code word, can 't be the 
same as that of each bit for marking the SID code word. 

26. The mobile terminal In claim 24, wherein the IBD code word for marking 
said IBD frames is composed of ail bits for carrying Block Amplitude 
parameter and at least one bit selected from the SID code word for marking a 
SID frame, and the value of each said bit for carrying Block Amplitude 
parameter is zero, and the value of each said bit selected from the SID code 
word can't be the same as that of each bit for marking the SID code word. 
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27. The mobile terminal in claim 24, wherein the IBD code word for marking 
said IBD frames Is composed of the SID code word for marking a SID frame 
and at least one reserved bit not included in the SID code word. 

28. The mobile terminal in any one of claim 25 to 27, wherein said IBD 
frames have the same length as said SID firame and the speech frames sent 
from the mobile terminal to said another mobile terminal. 

29. The mobile terminal in claim 28, wherei n if the hangover procedure is not 
enabled, said sending unit sends said IBD firames during the time when said 
SID firame is supposed to be sent in conventional communications. 

30. The mobile terminal in claim 28, wherein if the hangover procedure is 
enabled, said sending unit sends said IBD frames during the time when the 
silence speech frames are supposed to be sent in conventional 
communications, wherein said silence speech frames are used to compute 
said SID frame. 

31 . The mobile terminal in claim 28, further comprising: 

the second detecting unit, for detecting a frame received from said 
another mobile terminal; 

the second buffer, for when the received frame Is a IBD frame, buffering 
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the IBD frame; 

a Rx comfort noise unit, for generating background noise b y using the 
previously received SID frames. 

32. The mobile terminal in claim 31, wherein said detecting unit further 
comprising: 

an IBD frame identifying unit, for checking the SID code word of said 
received firame, and detecting the value of the remained b its after said bits 
are selected from said SID code word to judge whether said received frame 
is an IBD frame when said SID code word indicates that said received frame 
is not a SID frame. 

33. The mobile terminal in claim 31. wherein said detecting unit fu rther 
comprising: 

an IBD frame identifying unit, for checking the SID code word of said 
received frame, and detecting the value of all bits for canrying Block 
Amplitude parameter and the value of said bits selected from said SID code 
word to judge whether said received firame is an IBD frame when said SID 
code word indicates that said received frame is not a SID frame. 

34. The mobile terminal in claim 31, wherein said detecting unit further 
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comprising: 

an IBD frame identifying unit, for checking the SID code word of said 
received frame, and checking the value of said reserved bits not included in 
said SID code word to judge whether said received frame is an IBD frame 
5 when said SID code word Indicates that said received frame is not a speech 
frame. 
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